Degraded Character Recognition

نویسنده

  • Céline THILLOU
چکیده

The DCR application for Degraded Character Recognition was developed for the DEA (Diplôme d’Études Approfondies)’s thesis. The main objective is to recognize characters, degraded by an acquisition from a lowresolution camera. Several steps are needed and are detailed in this thesis from the thresholding step, which converts a gray-level picture in a black and white one, to the post-correction step, which adds linguistic information to improve recognition results. Several sophisticated methods are used, such as wavelet decomposition or neural networks for different purposes. A different view is given here to understand how steps before the recognition part can induce changes on results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multiple classifier for degraded machine printed character recognition Multiple classifier for degraded machine printed character recognition

The general problem of optical character recognition (OCR) remains a fundamental but not entirely solved issue in document analysis. In spite of significant improvements in the area of optical character recognition, the recognition of degraded printed characters, in particular, is still lacking satisfactory solutions. This paper presents an OCR method that combines the Hopfield network with a s...

متن کامل

Grayscale Feature Combination in Recognition based Segmentation for Degraded Text String Recognition

Grayscale feature is very effective for degraded character recognition. While many papers focus on different feature extraction algorithms on single character recognition, few deals with the impact of the selected feature on segmentation. For recognition-based segmentation, a good recognition performance on single character may not always have good performance on segmentation. In this paper, tw...

متن کامل

A Study of Touching Characters in Degraded Gurmukhi Text

Character segmentation is an important preprocessing step for text recognition. In degraded documents, existence of touching characters decreases recognition rate drastically, for any optical character recognition (OCR) system. In this paper a study of touching Gurmukhi characters is carried out and these characters have been divided into various categories after a careful analysis. Structural ...

متن کامل

Recognizing Degraded Handwritten Characters

In this report, a character recognition system is proposed that handles degraded manuscript documents which were discovered at the St. Catherine’s Monastery. In contrast to state-of-the-art Ocr systems, no early decision, namely the image binarization, needs to be performed. Thus, an object recognition methodology is adapted for the recognition of ancient manuscripts. Therefore, interest points...

متن کامل

Improved Degraded Document Recognition with Hybrid Modeling Techniques and Character N-Grams

In this paper a robust multifont character recognition system for degraded documents such as photocopy or fax is described. The system is based on Hidden Markov Models (HMMs) using discrete and hybrid modeling techniques, where the latter makes use of an information theory-based neural network. The presented recognition results refer to the SEDAL-database of English documents using no dictionar...

متن کامل

Degraded Document Analysis and Extraction of Original Text Document: An Approach without Optical Character Recognition

Document Image Analysis recognizes text and graphics in documents acquired as images. An approach without Optical Character Recognition (OCR) for degraded document image analysis has been adopted in this paper. The technique involves document imaging methods such as Image Fusing and Speeded Up Robust Features (SURF) Detection to identify and extract the degraded regions from a set of document i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004